RSS

RSS

The feed icon used in several browsers
Filename extension .rss, .xml
Internet media type application/rss+xml (registration not finished)[1]
Type of format Web syndication
Extended from XML

RSS (originally RDF Site Summary, often dubbed Really Simple Syndication) is a family of web feed formats used to publish frequently updated works—such as blog entries, news headlines, audio, and video—in a standardized format.[2] An RSS document (which is called a "feed", "web feed",[3] or "channel") includes full or summarized text, plus metadata such as publishing dates and authorship.

RSS feeds benefit publishers by letting them syndicate content automatically. A standardized XML file format allows the information to be published once and viewed by many different programs. They benefit readers who want to subscribe to timely updates from favorite websites or to aggregate feeds from many sites into one place.

RSS feeds can be read using software called an "RSS reader", "feed reader", or "aggregator", which can be web-based, desktop-based, or mobile-device-based. The user subscribes to a feed by entering into the reader the feed's URI or by clicking a feed icon in a web browser that initiates the subscription process. The RSS reader checks the user's subscribed feeds regularly for new work, downloads any updates that it finds, and provides a user interface to monitor and read the feeds. RSS allows users to avoid manually inspecting all of the websites they are interested in, and instead subscribe to websites such that all new content is pushed onto their browsers when it becomes available.

Contents

History

The RSS formats were preceded by several attempts at web syndication that did not achieve widespread popularity. The basic idea of restructuring information about websites goes back to as early as 1995, when Ramanathan V. Guha and others in Apple Computer's Advanced Technology Group developed the Meta Content Framework.[4]

RDF Site Summary, the first version of RSS, was created by Dan Libby and Ramanathan V. Guha at Netscape. It was released in March 1999 for use on the My.Netscape.Com portal. This version became known as RSS 0.9.[5] In July 1999, Dan Libby of Netscape produced a new version, RSS 0.91,[2] which simplified the format by removing RDF elements and incorporating elements from Dave Winer's scriptingNews syndication format.[6] Libby also renamed RSS Rich Site Summary and outlined further development of the format in a "futures document".[7]

This would be Netscape's last participation in RSS development for eight years. As RSS was being embraced by web publishers who wanted their feeds to be used on My.Netscape.Com and other early RSS portals, Netscape dropped RSS support from My.Netscape.Com in April 2001 during new owner AOL's restructuring of the company, also removing documentation and tools that supported the format.[8]

Two entities emerged to fill the void, with neither Netscape's help nor approval: The RSS-DEV Working Group and Winer, whose UserLand Software had published some of the first publishing tools outside of Netscape that could read and write RSS.

Winer published a modified version of the RSS 0.91 specification on the UserLand website, covering how it was being used in his company's products, and claimed copyright to the document.[9] A few months later, UserLand filed a U.S. trademark registration for RSS, but failed to respond to a USPTO trademark examiner's request and the request was rejected in December 2001.[10]

The RSS-DEV Working Group, a project whose members included Guha and representatives of O'Reilly Media and Moreover, produced RSS 1.0 in December 2000.[11] This new version, which reclaimed the name RDF Site Summary from RSS 0.9, reintroduced support for RDF and added XML namespaces support, adopting elements from standard metadata vocabularies such as Dublin Core.

In December 2000, Winer released RSS 0.92[12] a minor set of changes aside from the introduction of the enclosure element, which permitted audio files to be carried in RSS feeds and helped spark podcasting. He also released drafts of RSS 0.93 and RSS 0.94 that were subsequently withdrawn.[13]

In September 2002, Winer released a major new version of the format, RSS 2.0, that redubbed its initials Really Simple Syndication. RSS 2.0 removed the type attribute added in the RSS 0.94 draft and added support for namespaces. To preserve backward compatibility with RSS 0.92, namespace support applies only to other content included within an RSS 2.0 feed, not the RSS 2.0 elements themselves.[14] (Although other standards such as Atom attempt to correct this limitation, RSS feeds are not aggregated with other content often enough to shift the popularity from RSS to other formats having full namespace support.)

Because neither Winer nor the RSS-DEV Working Group had Netscape's involvement, they could not make an official claim on the RSS name or format. This has fueled ongoing controversy in the syndication development community as to which entity was the proper publisher of RSS.

One product of that contentious debate was the creation of an alternative syndication format, Atom, that began in June 2003.[15] The Atom syndication format, whose creation was in part motivated by a desire to get a clean start free of the issues surrounding RSS, has been adopted as IETF Proposed Standard RFC 4287.

In July 2003, Winer and UserLand Software assigned the copyright of the RSS 2.0 specification to Harvard's Berkman Center for Internet & Society, where he had just begun a term as a visiting fellow.[16] At the same time, Winer launched the RSS Advisory Board with Brent Simmons and Jon Udell, a group whose purpose was to maintain and publish the specification and answer questions about the format.[17]

In December 2005, the Microsoft Internet Explorer team[18] and Microsoft Outlook team[19] announced on their blogs that they were adopting the feed icon first used in the Mozilla Firefox browser (). In February 2006, Opera Software followed suit.[20] This effectively made the orange square with white radio waves the industry standard for RSS and Atom feeds, replacing the large variety of icons and text that had been used previously to identify syndication data.

In January 2006, Rogers Cadenhead relaunched the RSS Advisory Board without Dave Winer's participation, with a stated desire to continue the development of the RSS format and resolve ambiguities. In June 2007, the board revised their version of the specification to confirm that namespaces may extend core elements with namespace attributes, as Microsoft has done in Internet Explorer 7. According to their view, a difference of interpretation left publishers unsure of whether this was permitted or forbidden.

Example

As RSS files are essentially XML formatted plain text, the RSS file itself is relatively easily read both by automated processes and by humans alike. An example file could have contents such as the following. This could be placed on any appropriate communication protocol for file retrieval, such as http or ftp, and reading software would use the information to present a neat display to the end user.

<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0">
<channel>
	<title>RSS Title</title>
	<description>This is an example of an RSS feed</description>
	<link>http://www.someexamplerssdomain.com/main.html</link>
	<lastBuildDate>Mon, 06 Sep 2010 00:01:00 +0000 </lastBuildDate>
	<pubDate>Mon, 06 Sep 2009 16:45:00 +0000 </pubDate>
	<ttl>1800</ttl>
 
	<item>
		<title>Example entry</title>
		<description>Here is some text containing an interesting description.</description>
		<link>http://www.wikipedia.org/</link>
		<guid>unique string per item</guid>
		<pubDate>Mon, 06 Sep 2009 16:45:00 +0000 </pubDate>
	</item>
 
</channel>
</rss>

Variants

There are several different versions of RSS, falling into two major branches (RDF and 2.*).

The RDF (or RSS 1.*) branch includes the following versions:

The RSS 2.* branch (initially UserLand, now Harvard) includes the following versions:

Later versions in each branch are backward-compatible with earlier versions (aside from non-conformant RDF syntax in 0.90), and both versions include properly documented extension mechanisms using XML Namespaces, either directly (in the 2.* branch) or through RDF (in the 1.* branch). Most syndication software supports both branches. "The Myth of RSS Compatibility", an article written in 2004 by RSS critic and Atom advocate Mark Pilgrim, discusses RSS version compatibility issues in more detail.

The extension mechanisms make it possible for each branch to track innovations in the other. For example, the RSS 2.* branch was the first to support enclosures, making it the current leading choice for podcasting, and as of 2005 is the format supported for that use by iTunes and other podcasting software; however, an enclosure extension is now available for the RSS 1.* branch, mod_enclosure. Likewise, the RSS 2.* core specification does not support providing full-text in addition to a synopsis, but the RSS 1.* markup can be (and often is) used as an extension. There are also several common outside extension packages available, including a new proposal from Microsoft for use in Internet Explorer 7.

The most serious compatibility problem is with HTML markup. Userland's RSS reader—generally considered as the reference implementation—did not originally filter out HTML markup from feeds. As a result, publishers began placing HTML markup into the titles and descriptions of items in their RSS feeds. This behavior has become expected of readers, to the point of becoming a de facto standard, though there is still some inconsistency in how software handles this markup, particularly in titles. The RSS 2.0 specification was later updated to include examples of entity-encoded HTML; however, all prior plain text usages remain valid.

As of January 2007, tracking data from www.syndic8.com indicates that the three main versions of RSS in current use are 0.91, 1.0, and 2.0. Of these, RSS 0.91 accounts for 13 percent of worldwide RSS usage and RSS 2.0 for 67 percent, while RSS 1.0 has a 17 percent share.[22] These figures, however, do not include usage of the rival web feed format Atom. As of August 2008, the syndic8.com website is indexing 546,069 total feeds, of which 86,496 were some dialect of Atom and 438,102 were some dialect of RSS.[23]

Modules

The primary objective of all RSS modules is to extend the basic XML schema established for more robust syndication of content. This inherently allows for more diverse, yet standardized, transactions without modifying the core RSS specification.

To accomplish this extension, a tightly controlled vocabulary (in the RSS world, "module"; in the XML world, "schema") is declared through an XML namespace to give names to concepts and relationships between those concepts.

Some RSS 2.0 modules with established namespaces are:

Interoperability

Although the number of items in an RSS channel are theoretically not limited, some news aggregators do not support RSS files larger than 150KB (if all elements are provided on a new line, this size corresponds to approx. 2,800 lines).[24] For example, applications that rely on the Common Feed List of Windows might handle such files as if they were corrupt, and not open them. Interoperability can be maximized by keeping the file size under this limit.

BitTorrent and RSS

Several BitTorrent-based peer-to-peer applications also support RSS. Such feeds (also known as Torrent/RSS-es or Torrentcasts) allow client applications to download files automatically from the moment the RSS reader detects them (also known as Broadcatching).

Comparison with Atom

Both RSS and Atom are widely supported in all major consumer feed readers. RSS has gained widespread distribution through feed reader implementations earlier than Atom, which led to higher popularity. However, Atom has several advantages over RSS, such as less restrictive licensing, IANA registered MIME type, an XML namespace, support for relative URIs, and Relax NG support.[25] Technically, Atom should be considered the more advanced syndication format between the two.[26]

The following table summarizes those RSS elements that have their equivalents in Atom.

RSS 2.0 Atom 1.0
author author
category category
channel feed
copyright rights
description subtitle
description summary and/or content
generator generator
guid id
image logo
item entry
lastBuildDate (in channel) updated
link link
managingEditor author or contributor
pubDate published (subelement of entry)
title title
ttl -

See also

References

  1. ^ "The application/rss+xml Media Type". Network Working Group. May 22, 2006. http://tools.ietf.org/id/draft-nottingham-rss-media-type-00.txt. Retrieved 2007-08-16. 
  2. ^ a b Libby, Dan (1999-07-10). "RSS 0.91 Spec, revision 3". Netscape Communications. Archived from the original on 2000-12-04. http://web.archive.org/web/20001204093600/http://my.netscape.com/publish/formats/rss-spec-0.91.html. Retrieved 2007-02-14. 
  3. ^ "Web feeds | RSS | The Guardian | guardian.co.uk", The Guardian, London, 2008, webpage: GuardianUK-webfeeds.
  4. ^ Lash, Alex (1997-10-03). "W3C takes first step toward RDF spec". http://news.com.com/2100-1001-203893.html. Retrieved 2007-02-16. 
  5. ^ "My Netscape Network: Quick Start". Netscape Communications. Archived from the original on 2000-12-08. http://web.archive.org/web/20001208063100/http://my.netscape.com/publish/help/quickstart.html. Retrieved 2006-10-31. 
  6. ^ RSS Advisory Board (June 7, 2007). "RSS History". http://www.rssboard.org/rss-history. Retrieved 2007-09-04. 
  7. ^ "MNN Future Directions". Netscape Communications. Archived from the original on 2000-12-04. http://web.archive.org/web/20001204123600/http://my.netscape.com/publish/help/futures.html. Retrieved 2006-10-31. 
  8. ^ Andrew King (2003-04-13). "The Evolution of RSS". http://www.webreference.com/authoring/languages/xml/rss/1/. Retrieved 2007-01-17. 
  9. ^ Winer, Dave (2000-06-04). "RSS 0.91: Copyright and Disclaimer". UserLand Software. http://backend.userland.com/rss091#copyrightAndDisclaimer. Retrieved 2006-10-31. 
  10. ^ U.S. Patent & Trademark Office. "'RSS' Trademark Latest Status Info". http://tarr.uspto.gov/servlet/tarr?regser=serial&entry=78025336. 
  11. ^ RSS-DEV Working Group (2000-12-09). "RDF Site Summary (RSS) 1.0". http://web.resource.org/rss/1.0/spec. Retrieved 2006-10-31. 
  12. ^ Winer, Dave (2000-12-25). "RSS 0.92 Specification". UserLand Software. http://backend.userland.com/rss092. Retrieved 2006-10-31. 
  13. ^ Winer, Dave (2001-04-20). "RSS 0.93 Specification". UserLand Software. http://backend.userland.com/rss093. Retrieved 2006-10-31. 
  14. ^ Harvard Law (2007-04-14). "Top-level namespaces". http://cyber.law.harvard.edu/rss/toplevelNamespace.html. Retrieved 2009-08-03. 
  15. ^ Festa, Paul (2003-08-04). "Dispute exposes bitter power struggle behind Web logs". news.cnet.com. http://news.cnet.com/Battle-of-the-blog/2009-1032_3-5059006.html. Retrieved 2008-08-06. "The conflict centers on something called Really Simple Syndication (RSS), a technology widely used to syndicate blogs and other Web content. The dispute pits Harvard Law School fellow Dave Winer, the blogging pioneer who is the key gatekeeper of RSS, against advocates of a different format." 
  16. ^ "Advisory Board Notes". RSS Advisory Board. 2003-07-18. http://www.rssboard.org/advisory-board-notes. Retrieved 2007-09-04. 
  17. ^ "RSS 2.0 News". Dave Winer. http://www.scripting.com/2003/07/18.html#rss20News. Retrieved 2007-09-04. 
  18. ^ Icons: It’s still orange, Microsoft RSS Blog, December 14, 2005
  19. ^ RSS icon goodness, blog post by Michael A. Affronti of Microsoft (Outlook Program Manager), December 15, 2005
  20. ^ "Making love to the new feed icon". Opera Desktop Team. 2006-02-16. http://my.opera.com/desktopteam/blog/show.dml/146296. Retrieved 2010-07-04. 
  21. ^ "Namespaces in XML 1.0" (2nd ed.). W3C. August 16, 2006. http://www.w3.org/TR/REC-xml-names/. 
  22. ^ Holzner, Steven. "Peachpit article". Peachpit article. http://www.peachpit.com/articles/article.aspx?p=674690. Retrieved 2010-12-11. 
  23. ^ "Syndic8 stats table". Syndic8.com. http://www.syndic8.com/stats.php?Section=feeds#tabtable. Retrieved 2011-08-12. 
  24. ^ [webdevelopmentadelaide.com#didyouknow "Did you know?"]. webdevelopmentadelaide.com#didyouknow. Retrieved 2011-11-21. 
  25. ^ Sikos, Leslie (2011). Web standards - Mastering HTML5, CSS3, and XML. Apress. ISBN 978-1-4302-4041-9. http://www.masteringhtml5css3.com. 
  26. ^ Sikos, Leslie. "RSS tutorial". http://www.lesliesikos.com/tutorials/rss/. 

External links